Learning What's Easy: Fully Differentiable Neural Easy-First Taggers

نویسندگان

  • André F. T. Martins
  • Julia Kreutzer
چکیده

We introduce a novel neural easy-first decoder that learns to solve sequence tagging tasks in a flexible order. In contrast to previous easy-first decoders, our models are end-to-end differentiable. The decoder iteratively updates a “sketch” of the predictions over the sequence. At its core is an attention mechanism that controls which parts of the input are strategically the best to process next. We present a new constrained softmax transformation that ensures the same cumulative attention to every word, and show how to efficiently evaluate and backpropagate over it. Our models compare favourably to BILSTM taggers on three sequence tagging tasks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Two Novel Learning Algorithms for CMAC Neural Network Based on Changeable Learning Rate

Cerebellar Model Articulation Controller Neural Network is a computational model of cerebellum which acts as a lookup table. The advantages of CMAC are fast learning convergence, and capability of mapping nonlinear functions due to its local generalization of weight updating, single structure and easy processing. In the training phase, the disadvantage of some CMAC models is unstable phenomenon...

متن کامل

Auto-Differentiating Linear Algebra

Development systems for deep learning, such as Theano, Torch, TensorFlow, or MXNet, are easyto-use tools for creating complex neural network models. Since gradient computations are automatically baked in, and execution is mapped to high performance hardware, these models can be trained endto-end on large amounts of data. However, it is currently not easy to implement many basic machine learning...

متن کامل

FlexTag: A Highly Flexible PoS Tagging Framework

We present FlexTag, a highly flexible PoS tagging framework. In contrast to monolithic implementations that can only be retrained but not adapted otherwise, FlexTag enables users to modify the feature space and the classification algorithm. We categorize existing PoS tagger implementations into one of three categories with regards to model-training capabilities and the level of access those imp...

متن کامل

Numerical Coordinate Regression with Convolutional Neural Networks

We study deep learning approaches to inferring numerical coordinates for points of interest in an input image. Existing convolutional neural network-based solutions to this problem either take a heatmap matching approach or regress to coordinates with a fully connected output layer. Neither of these approaches is ideal, since the former is not entirely differentiable, and the latter lacks inher...

متن کامل

APPLICATION OF NEURAL NETWORK IN EVALUATION OF SEISMIC CAPACITY FOR STEEL STRUCTURES UNDER CRITICAL SUCCESSIVE EARTHQUAKES

Depending on the tectonic activities, most buildings subject to multiple earthquakes, while a single design earthquake is suggested in most seismic design codes. Perhaps, the lack of easy assessment to second shock information and sometimes use of inappropriate methods in estimating these features cause successive earthquakes mainly were ignored in the analysis procedure. In order to overcome t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017